-
Notifications
You must be signed in to change notification settings - Fork 1.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add OCR acction with tessercat #3074
base: master
Are you sure you want to change the base?
Conversation
Thanks for the PR, but please always discuss the new features before spending time implementing it. Also it would be faster for you if you search among the open and closed issues to see if a topic is discussed. In my opinion this PR is unacceptable for few reasons:
I suggest we keep your PR (at least partially) for when the plugin system is implemented, as some of the parts (e.g icons) can be used in the PR. I would also like to know the opinion of other devs (e.g @veracioux , @hosiet , @ZetaoYang ) as the above does not necessarily reflect the stance of the project as a team, but rather just my opinion. |
To be completely honest, the implementing time was nearly none, I needed the feature, and I'm currently using it in a more effective and usable way than piping commands. Also, this project, despite being an amazing tool, lacks of any OCR capability and, for the past nearly 3 years, since the first request to implement this came (#720) AFAIK no one has done really any code to implement the feature in a usable way for any kind of user. AFAIK plugins system is only a concept, nothing implemented.
Let me, greenshot, sharex and snagit disagree, OCR is indeed a great and wanted feature for screenshot tools. In my humble opinion, here is a PR to allow 90% of your users to do what they need today. You can easily think about moving it to plugins when the plugin system is implemented. That being said, I have no need for this PR to be merged, I can live with my fork, just wanted to give something back to the community, I guess anyone who needs it can take it from my copy of the project. Regards! |
This comment was marked as off-topic.
This comment was marked as off-topic.
I think this argument can also be applied to the currently implemented post-processing (draw shape, pen, pixelate, ...) capabilities. I use flameshot not because it's the only software that takes photos of my screen. I use it because it's feature rich and I really miss the OCR feature. |
Flameshot is a screenshot annotation tool. All those features you mentioned are necessary for annotating screenshots. OCR has nothing to do with annotation.
Then please go and support the plugin system with which this feature will be available. This was a unanimous decision by all devs to only allow such features as plugin. One of the reason for this is extremely high maintenance burden of such features on all supported operating systems.
They only support Windows. We support Linux, Mac, Windows and even BSD.
For now, the policy of maintainers is to keep it screenshot and annotation specific. Any other gizmo (watermark, printing, GIF, OCR, barcode reader,...) should only and only be allowed through a plugin system. This is to help us keep this project maintainable and prevent it from bloat. But that said, Flameshot is a FLOSS tool (emphasis on "free" and "libre"), therefore, anyone is legally allowed to maintain their own fork based on the restrictions imposed by the license of Flameshot. Also they are all Windows-only tools and are not cross-platform, or commercial in case od Snagit.
You are right, it will allow some some users do OCR today, but maintaining it will be an everlasting burden on devs. You kindly provided a PR but you are not taking the responsibility to take care of this part of the code for ever. Do you? I also don't think 90% of Flameshot users are in need of OCR. We don't have telemetry, so we don't have statistics on our user count, but I bet the number you should aim for is less than 1%.
True, it is not implemented yet (at least not yet landed on the main branch). But:
I thank you again for your PR. But I think you can also agree that in general:
|
Yes...? So does tesseract. Please just close this issue instead of mentioning "create your own fork", "submit a PR" or other FOSS clichés to end-users ;-). |
I agree with @mmahmoudian To keep bloat to minimum, ensure low maintenance burden and provide high cross-platform compatibility, it just doesn't make sense to have advanced tools like OCR as part of the core project.. Plugin system (plug and play) is a very reasonable solution. From my experience, even highly maintained projects like Tesseract are prone to providing bad results without tuning. Using plugins would ensure flexibility (freedom to use alternatives) and the ability to tweak via platform specific tools, rather than having to re-compile the entire project. In other words, offloading effort of specific features (like OCR) would benefit more (in terms of quality/updates) from specialized projects and dedicated maintainers, rather than overloading this project This really sums up the reason why I enjoyed this PR in the first place:
For clarity, alternative utilities such as ShareX are clogged with unnecessary tools.. I hope Flameshot never gets to that state. |
The only reason I've kept this PR open is due it's simplicity and straightforward implementation. I am hoping that we can somehow incorporate this PR as an add-on so that @rsrdesarrollo can maintain this add-on instead a a full fork. At the end of the day, I'm one person in a team and I should not and cannot act alone. @debuglevel This is a fully volunteer-based FLOSS project. We are not getting paid to serve you, nor we owe you anything! This is our project and we maintain and manage it as we see fit. Lastly, be civil; we have zero tolerance policy for trolls. |
Yeah, therefore just close the issue so everybody knows this is a "won't fix". Keeping it open does not help anybody but just attracts more people who hope this will get implemented. |
@debuglevel clearly OCR functionality isn't out of the question yet, potential support boils down to implementing it in a reasonable way that wouldn't overload this project
WorkaroundFor now, see: |
This PR adds an OCR action using tesseract. Fix #702
It needs tesseract to be in path or configure full path